
***********************************************
Replication files for

Voicing disagreement in science: Missing women

Review of Economics and Statistics

Author: David Klinowski
***********************************************


ORGANIZATION

The files to reproduce the results are organized in three folders:

1. "disagreement_restat_code": contains a stata do file for each figure and table in the main text and the online appendix, as well as a "_rundirectory.do" file that describes and executes the entire set of do files.

2. "disagreement_restat_data": the "output" subdirectory contains stata dta files that are called by the do files in (1) to produce the results, and the "temporary" subdirectory contains stata dta files created by the do files in (1) in the process of producing the results.

3. "disagreement_restat_results": contains each figure and table in the main text and the online appendix, which are generated by the code in (1). It also contains stata markdown files that output into a word document each table generated by the code in (1).

In addition, the experiment instructions are included in the document "disagreement_restat_experiment_instructions.pdf"



EXECUTION OF CODE

To reproduce the results, follow these steps:

1. Open "disagreement_restat_code/_rundirectory.do"

2. Type in your directory path into line 11: global project // ** TYPE PROJECT DIRECTORY HERE **

3. Run "disagreement_restat_code/_rundirectory.do"

This will run each do file in the "disagreement_restat_code" directory and reproduce all the figures and tables in the main text and the online appendix, except for Table A13 and Figures A5-A6, which use proprietary citations data from Clarivate Analytics Web of Science and are therefore not included in these files.  Clarivate Analytics Web of Science is a paid-access platform that I accessed under a license to Stanford University. Anyone who pays for a license would be able to obtain the data. For more information, see: https://clarivate.com/webofsciencegroup/solutions/web-of-science/



DESCRIPTION OF DATASETS AND DICTIONARY OF VARIABLES

1. "aer_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
article_id_chronological 	another id of paper that goes in chronological order of publication                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
month_rev       		month of publication                   
day             		day of publication                 
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
month_num       		month in number                 
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
comment         		indicator that paper is comment                 
reply           		indicator that paper is a reply to a comment                 
letter          		indicator that paper is a comment or a reply to a comment                 
research_article 		indicator that paper is a research article                 
female_manual   		female indicator based on visual inspection of name                 
call_to         		article_id of paper criticized by this comment 
call_by         		article_id of comment to this paper
year_call_to    		year of publication of paper criticized by this comment
jel_classification		jel_classification
jel_*				variables to parse jel codes
field_microeconomics		indicator that paper is in microeconomics                 
field_theory    		indicator that paper is in theory                 
field_macroeconomics 		indicator that paper is in macroeconomics                 
field_labor     		indicator that paper is in labor                 
field_econometrics 		indicator that paper is in econometrics                 
field_io        		indicator that paper is in io                 
field_international 		indicator that paper is in international economics                 
field_finance   		indicator that paper is in finance                 
field_public    		indicator that paper is in public economics                 
field_health_urban 		indicator that paper is in health or urban                 
field_development 		indicator that paper is in development                 
field_history  			indicator that paper is in history                 
field_lab       		indicator that paper is in lab experiments                 
field_other     		indicator that paper is in other field                 

---

2. "aer_fields.dta"

field_number    		1-microeconomics, 2-theory, 3- macroeconomics, 4-labor, 5-econometrics, 6-industrial organization, 7-international 				economics, 8-finance, 9-public economics, 10-health or urban economics, 11-development, 12-history, 13-laboratory 				experiments, and 14-other                
fraction_articles 		share of all research papers in field                 
fraction_comments		share of all comments in field                 
fraction_female 		share of female authors in field

---

3. "all_aea_data_nogender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
article_id_chronological 	another id of paper that goes in chronological order of publication                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
month_rev       		month of publication                   
day             		day of publication                 
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
month_num       		month in number                 

---

4. "asr_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
day             		day of publication
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
letter          		indicator that paper is a comment or a reply to a comment                 
comment         		indicator that paper is comment                 
reply           		indicator that paper is a reply to a comment                 
research_article 		indicator that paper is a research article                 
call_to         		article_id of paper criticized by this comment 
call_by         		article_id of comment to this paper
year_call_to    		year of publication of paper criticized by this comment
month_call_to			month of publication of paper criticized by this comment

---

5. "biorxiv_gender.dta"

subject_id      		field of paper                                              
results_type_id 		type of paper                                         
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
doi             		doi of paper                  
year            		year paper was posted on biorxiv                
title           		paper title                   
sur_name        		last name of author                  
given_name      		given name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              

---

6. "contest_experiment.dta" (selected variables defined)

sessionconfigname			"enforce": contest treatment, "distributional_prefs": deduct-a-$1 treatment
intro1playertreatment			"plain": main treatment, "conciliatory": 50-50 treatment, "aggressive": take-$1 treatment
intro1playerprolific_pid       		participant prolific id, deleted for confidentiality
intro1playercorrect_table  		participant solves task correctly
intro1playercorrect_table_extra 	participant solves task correctly in additional incentivized round
intro1playerbelief_self  		beliefs of chance of solving the task correctly
intro1playerbelief_others       	belief of how many 20 other participants solved the task correctly
questionnaire1playerfemale		1: female, 2: male, 3: other, 4: prefer not to say
questionnaire1playerage			1: 18-25, 2: 26-35, 3: 36-45, 4: 46-55, 5: 56-65, 6: 66 or older, 7: prefer not to say 
questionnaire1playerresidence		1: Northeast, 2: South, 3: Midwest, 4: West, 5: prefer not to say
questionnaire1playerrace_asian		1: No, 2: Yes, 3: prefer not to say
questionnaire1playerrace_black		1: No, 2: Yes, 3: prefer not to say
questionnaire1playerrace_latinx		1: No, 2: Yes, 3: prefer not to say
questionnaire1playerrace_white		1: No, 2: Yes, 3: prefer not to say
questionnaire1playereducation		1: less than high school degree, 2: high school degree or equivalent (for example, GED), 3: some college but no degree, 4: associate degree, 5: bachelor degree, 6: graduate degree, 7: prefer not to say
questionnaire1playerhigh_school_	Participant attended high school in the US, 1: No, 2: Yes, 3: prefer not to say
questionnaire1playerdifficult_in	How difficult participant found the instructions, 1: extremely easy, 2: moderately easy, 3: slightly easy, 4: neither easy nor difficult, 5: slightly difficult, 6: moderately difficult, 7: extremely difficult
intro1playerbelief_treatment		Contingency probability treatment in the deduct-a-$1 treatment, "center": (0.5,0.5) and (0.5,0.25,0.25) , "extreme": (0.8,0.2) and (0.76,0.12,0.12)
age					participant age from prolific profile
num_approvals				participant number of approvals from prolific profile
sex					participant sex from prolific profile
enforce					indicator for contest treatment
plain					indicator for main treatment (as opposed to treatments with incentives to contest)
conciliatory				indicator for 50-50 treatment
aggressive				indicator for take-$1 treatment
treatment				1: main treatment, 2: 50-50 treatment, 3: take-$1 treatment
belief_treatment_center			0: contingency probabilities (0.8,0.2) and (0.76,0.12,0.12), 1: contingency probabilities (0.5,0.5) and (0.5,0.25,0.25)
female 					female indicator from participant prolific profile
race 					1: asian, 2: black, 3: latinx, 4: white, 5: mixed or other
race_asian     				indicator for race = 1                  
race_black                      	indicator for race = 2
race_latino                     	indicator for race = 3
race_white                      	indicator for race = 4
race_multiple~r                 	indicator for race = 5
northeast                       	indicator for Northeast residence
south                           	indicator for South residence
midwest                         	indicator for Midwest residence
west                            	indicator for West residence
residence                       	1: Northeast, 2: South, 3: Midwest, 4: West. Subjects who answered "prefer not to say" were randomized into a residence category using sample residence distribution as weights
ageb                            	participant age bin based on "age" variable, 1: 18-20, 2: 21-23, 3: 24-27, 4: 28 or older
edub                            	participant education bin based on "questionnaire1playereducation" variable, 1: 1-2, 2: 2, 3: 4-5 
hs_or_less                      	indicator for edub = 1
some_college_but_no_degree		indicator for edub = 2                 
college_degree_or_more			indicator for edub = 3                 
hs_in_us                        	indicator for questionnaire1playerhigh_school_ = 2
experience                      	number of previous approvals on prolific 1: 0-10, 2: 11-20, 3: 21 or more  
indifferent                     	within contest treatment, indicator of potential indifference in the choice to contest, 1: intro1playerbelief_self = 0 or intro1playerbelief_others = 20 
participantcode_num			participant id, numeric

---

7. "jama_pubmed_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
day  				day of publication                
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
letter          		indicator that paper is a comment or a reply to a comment                 
reply           		indicator that paper is a reply to a comment                 
comment         		indicator that paper is comment                 
research_article 		indicator that paper is a research article                 
pubmed_article_id      		leftover variable from dataset construction
merge_jama_pubmed		leftover variable from dataset construction
pubmed_journal    		name of journal in pubmed data   
pubmed_html_file		scraped html file from pubmed       		
pubmed_author_id		id of author within paper in pubmed     
pubmed_year          		year of publication in pubmed
pubmed_month         		month of publication in pubmed
pubmed_day           		day of publication in pubmed
pubmed_title         		paper title in pubmed
article_with_etal 		indicator of list of authors truncated with "et al." in eTOC

--

8. "nature_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
day  				day of publication                
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
comment         		indicator that paper is comment                 
reply           		indicator that paper is a reply to a comment                 
letter          		indicator that paper is a comment or a reply to a comment                 
research_article 		indicator that paper is a research article                 
call_to         		article_id of paper criticized by this comment 
call_by         		article_id of comment to this paper
year_call_to    		year of publication of paper criticized by this comment
month_call_to   		month of publication of paper criticized by this comment
correction      		type of correction issued to the paper: addendum, correction, corrigendum, erratum, notice, retraction
date_correction 		date of correction

---

9. "nature_fields_matched_ids.dta"

article_id      		id of paper                 
biological      		indicator for paper in the biological sciences
earth           		indicator for paper in the earth sciences
health          		indicator for paper in the health sciences
physical        		indicator for paper in the physical sciences
social          		indicator for paper in the social sciences

---

10. "pnas_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
day  				day of publication                
type            		paper type                 
subfield        		subfield of the paper                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
letter          		indicator that paper is a comment or a reply to a comment                 
reply           		indicator that paper is a reply to a comment                 
comment         		indicator that paper is comment                 
research_article 		indicator that paper is a research article                 
call_to         		article_id of paper criticized by this comment 
call_by         		article_id of comment to this paper
year_call_to    		year of publication of paper criticized by this comment
month_call_to   		month of publication of paper criticized by this comment

---

11. "pnas_editors_match_to_authors.dta"

article_author_id 		id of paper-author observation                
editor_id       		id of editor in the mastheads scrape                
editor_name     		full name of editor in masthead                  
is_editor       		indicator for being an editor of pnas                  
earliest_volume_as_editor 	earliest volume in which name is listed as editor in mastheads 

---

12. "science_data_gender.dta"

journal         		name of journal                  
html_file       		scraped html file number                 
article_id      		id of paper                 
author_id       		id of author within paper                 
volume          		volume of publication                 
issue           		issue of publication                 
year            		year of publication                 
month           		month of publication              
day  				day of publication                
type            		paper type                 
title           		paper title                 
full_name       		full name of author                  
first_name      		first name of author                  
female_ssa      		female indicator based on social security administration birth records                 
male_ssa        		male indicator based on social security administration birth records                 
female_genderize		female indicator based on genderize.io                 
male_genderize  		male indicator based on genderize.io              
letter          		indicator that paper is a comment or a reply to a comment                 
reply           		indicator that paper is a reply to a comment                 
comment         		indicator that paper is comment                 
research_article 		indicator that paper is a research article                 
call_to         		article_id of paper criticized by this comment 
call_by_1       		article_id of 1st comment to this paper
call_by_2       		article_id of 2nd comment to this paper
call_by_3       		article_id of 3rd comment to this paper
call_by_4       		article_id of 4th comment to this paper
call_by_5       		article_id of 5th comment to this paper
call_by_6       		article_id of 6th comment to this paper
call_by_7       		article_id of 7th comment to this paper
call_by_8       		article_id of 8th comment to this paper

---

13. "science_editors_match_to_authors.dta"

article_author_id 		id of paper-author observation                
editor_id       		id of editor in the mastheads scrape                
editor_name     		full name of editor in masthead                  
is_editor       		indicator for being an editor of pnas                  
earliest_volume_as_editor 	earliest volume in which name is listed as editor in mastheads 





